AITopics | bilinear game

Stochastic min-max optimization has gained interest in the machine learning community with the advancements in GANs and adversarial training. Although game optimization is fairly well understood in the deterministic setting, some issues persist in the stochastic regime. Recent work has shown that stochastic gradient descent-ascent methods such as the optimistic gradient are highly sensitive to noise or can fail to converge. Although alternative strategies exist, they can be prohibitively expensive. We introduce Omega, a method with optimistic-like updates that mitigates the impact of noise by incorporating an EMA of historic gradients in its update rule. We also explore a variation of this algorithm that incorporates momentum. Although we do not provide convergence guarantees, our experiments on stochastic games show that Omega outperforms the optimistic gradient method when applied to linear players.

artificial intelligence, machine learning, omega, (18 more...)

arXiv.org Artificial Intelligence

2306.07905

Country:

North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Russia (0.04)
Asia > Russia (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

When is Momentum Extragradient Optimal? A Polynomial-Based Analysis

Kim, Junhyung Lyle, Gidel, Gauthier, Kyrillidis, Anastasios, Pedregosa, Fabian

arXiv.org Artificial IntelligenceFeb-22-2023

The extragradient method has recently gained increasing attention, due to its convergence behavior on smooth games. In $n$-player differentiable games, the eigenvalues of the Jacobian of the vector field are distributed on the complex plane. Thus, compared to classical (i.e., single player) minimization, games exhibit more convoluted dynamics, where the extragradient method succeeds while simple gradient method could fail. Yet, in this work, instead of focusing on a specific problem class, we follow a reverse path: starting from the momentum extragradient method as the selected optimizer, and using polynomial-based analyses, we identify problem subclasses where the use of momentum in extragradient motions lead to optimal performance. Based on the hyperparameter setup, we show that the extragradient with momentum exhibits three different modes of convergence: when the eigenvalues are distributed $i)$ on the real line, $ii)$ both on the real line along with complex conjugates, and $iii)$ only as complex conjugates. We then derive the optimal hyperparameters for each case, and show that it achieves an accelerated convergence rate.

artificial intelligence, convergence rate, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2211.04659

Country:

North America > United States > New York (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Filters

Collaborating Authors

bilinear game

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

6c7cd904122e623ce625613d6af337c4-AuthorFeedback.pdf

69ce18ad9f53f28e8e7ac1649ae02337-Supplemental-Conference.pdf

69ce18ad9f53f28e8e7ac1649ae02337-Paper-Conference.pdf

Poincaré Recurrence, Cycles and Spurious Equilibria in Gradient-Descent-Ascent for Non-Convex Non-Concave Zero-Sum Games

69ce18ad9f53f28e8e7ac1649ae02337-Supplemental-Conference.pdf

Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure Huizhuo Yuan Chris Junchi Li Gauthier Gidel Michael I. Jordan, Quanquan Gu Simon S. Du?

Poincaré Recurrence, Cycles and Spurious Equilibria in Gradient-Descent-Ascent for Non-Convex Non-Concave Zero-Sum Games

6c7cd904122e623ce625613d6af337c4-AuthorFeedback.pdf

Omega: Optimistic EMA Gradients

When is Momentum Extragradient Optimal? A Polynomial-Based Analysis